Open Issues on Codebook Generation in Image Classification Tasks

نویسندگان

  • Luca Piras
  • Giorgio Giacinto
چکیده

In the last years the use of the so-called bag-of-features approach, often referred to also as the codebook approach, has extensively gained large popularity among researchers in the image classification field, as it exhibited high levels of performance. A large variety of image classification, scene recognition, and more in general computer vision problems have been addressed according to this paradigm in the recent literature. Despite the fact that some papers questioned the real effectiveness of the paradigm, most of the works in the literature follows the same approach for codebook creation, making it a standard “de facto”, without any critical investigation on the suitability of the employed procedure to the problem at hand. The most widespread structure for codebook creation is made up of four steps: dense sampling image patch detection; use of SIFT as patch descriptors; use of the k-means algorithms for clustering patch descriptors in order to select a small number of representative descriptors; use of the SVM classifier, where images are described by a codebook whose vocabulary is made up of the selected representative descriptors. In this paper, we will focus on a critical review of the third step of this process, to see if the clustering step is really useful to produce effective codebooks for image classification tasks. Reported results clearly show that a codebook created according to a purely random extraction of the patch descriptors from the set of descriptors extracted from the images in a dataset, is able to improve classification performances with respect to the performances attained with codebooks created by the clustering process.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-layer Orthogonal Visual Codebook for Image Classification

Recently, Bag of Visual Words (BoW) model has shown its success in image classification and retrieval. The key idea behind the BoW model is to quantize the continuous highdimensional space of image features (e.g. SIFT [1]) to a manageable visual codebook. The quality of the visual codebook has an important impact on BoW-based methods. Different from the existing techniques, such as Kernel codeb...

متن کامل

Palarimetric Synthetic Aperture Radar Image Classification using Bag of Visual Words Algorithm

Land cover is defined as the physical material of the surface of the earth, including different vegetation covers, bare soil, water surface, various urban areas, etc. Land cover and its changes are very important and influential on the Earth and life of living organisms, especially human beings. Land cover change monitoring is important for protecting the ecosystem, forests, farmland, open spac...

متن کامل

Convergence Analysis of Codebook Generation Techniques for Vector Quantization using K-Means Clustering Technique

Vector Quantization (VQ) is one of the lossy image compression techniques. VQ comprises of three different phases: Codebook Generation, Image Encoding and Image Decoding. The performance of VQ is mainly based on the codebook generation phase. In this paper, five different codebook generation techniques namely the Simple Codebook Generation (SCG), Ordered Codebook Generation (OCG), Codebook Gene...

متن کامل

Combined Descriptors in Spatial Pyramid Domain for Image Classification

Recently spatial pyramid matching (SPM) with scale invariant feature transform (SIFT) descriptor has been successfully used in image classification. Unfortunately, the codebook generation and feature quantization procedures using SIFT feature have the high complexity both in time and space. To address this problem, in this paper, we propose an approach which combines local binary patterns (LBP)...

متن کامل

Locally Global Codebook for Image Retrieval and Clustering Using Vector Quantization

In this paper, the incremental codebook generation process, which is a technique for representing a database of images as a single codebook, that captures the content of all the images is proposed. Vector quantization (VQ) is used for creating the codebook of the image. The main problem with VQ is the size of the training sequence that is used to generate the global codebook. This paper explain...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014